Towards matching user mobility traces in large-scale datasets
نویسندگان
چکیده
The problem of unicity and reidentifiability of records in large-scale databases has been studied in different contexts and approaches, with focus on preserving privacy or matching records from different data sources. With an increasing number of service providers nowadays routinely collecting location traces of their users on unprecedented scales, there is a pronounced interest in the possibility of matching records and datasets based on spatial trajectories. Extending previous work on reidentifiability of spatial data and trajectory matching, we now present the first large-scale analysis of user matchability in real mobility datasets on realistic scales, i.e. among two datasets that consist of several million people’s mobility traces for a one week interval each. We extract the relevant statistical properties which influence the matching process and provide an estimate on a performance of matching and thus the matchability of users. We derive that for individuals with typical activity in the transportation system (those making 3-4 trips per day on average), a matching algorithm based on the co-occurrence of their activities is expected to achieve a 16.8% success rate based only on a one-week long observation of their mobility traces. Extrapolating for longer time intervals, we expect a success rate of over 55% after four week long observations. We further evaluate different scenarios of data collection frequency, giving estimates of matchability over time in several realastic cases of mobility datasets.
منابع مشابه
Predict User In-World Activity via Integration of Map Query and Mobility Trace
People often resort to map search engine or other locationbased services for location information when planning long trips or local navigation, and their map queries as well as mobility trace will be accumulated and stored in user log. These data offers valuable information for studying the mechanism of human mobility pattern, furthermore, map query data enable us to sense users’ real-time inte...
متن کاملYou Are How You Move: Linking Multiple User Identities From Massive Mobility Traces
Understanding the linkability of online user identifiers (IDs) is critical to both service providers (for business intelligence) and individual users (for assessing privacy risks). Existing methods are designed to match IDs across two services, but face key challenges of matching multiple services in practice, particularly when users have multiple IDs per service. In this paper, we propose a no...
متن کاملAnalyzing Mobility-Traffic Correlations in Large WLAN Traces: Flutes vs. Cellos
Two major factors affecting mobile network performance are mobility and traffic patterns. Simulations and analytical-based performance evaluations rely on models to approximate factors affecting the network. Hence, the understanding of mobility and traffic is imperative to the effective evaluation and efficient design of future mobile networks. Current models target either mobility or traffic, ...
متن کاملCharacterizing User Behavior and Network Load on a Large-Scale Wireless Mesh Network
Wireless mesh networks represent a promising paradigm to provide a scalable infrastructure for Internet access in metropolitan areas. In this paper, a large-scale wireless mesh testbed deployed in three cities in the Trentino region is described and experimentation results obtained from the public use of the testbed are reported and analyzed. The large-scale of the deployment and high number of...
متن کاملMobReduce: Reducing State Complexity of Mobility Traces
User traces are essential for analysis of human behavior and development of opportunistic networking protocols and applications. As user traces are collected with high granularity to apply them in diverse scenarios, they have a high complexity resulting from the large number of user states. We present MobReduce: a methodology for reducing the number of states in user traces. We apply MobReduce ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1709.05772 شماره
صفحات -
تاریخ انتشار 2017